MetaAugment: Sample-Aware Data Augmentation Policy Learning
نویسندگان
چکیده
Automated data augmentation has shown superior performance in image recognition. Existing works search for dataset-level policies without considering individual sample variations, which are likely to be sub-optimal. On the other hand, learning different samples naively could greatly increase computing cost. In this paper, we learn a sample-aware policy efficiently by formulating it as reweighting problem. Specifically, an network takes transformation and corresponding augmented inputs, outputs weight adjust loss computed task network. At training stage, minimizes weighted losses of images, while on validation set via meta-learning. We theoretically prove convergence procedure further derive exact rate. Superior is achieved widely-used benchmarks including CIFAR-10/100, Omniglot, ImageNet.
منابع مشابه
Policy Aware Geospatial Data
Digital Rights Management (DRM) prevents end-users from using content in a manner inconsistent with its creator’s wishes. The license describing these use-conditions typically accompanies the content as its metadata. A resulting problem is that the license and the content can get separated and lose track of each other. The best metadata have two distinct qualities – they are created automatical...
متن کاملVisual Data Augmentation through Learning
The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several million samples, which constitutes their extension a colossal task. In addition, the state-ofthe-art data-driven methods demand a vast amount of data, hence...
متن کاملInefficiency of Data Augmentation for Large Sample Imbalanced Data
Many modern applications collect large sample size and highly imbalanced categorical data, with some categories being relatively rare. Bayesian hierarchical models are well motivated in such settings in providing an approach to borrow information to combat data sparsity, while quantifying uncertainty in estimation. However, a fundamental problem is scaling up posterior computation to massive sa...
متن کاملTowards Policy-aware Queries over Linked Data
The Linked Data principles for publishing data on the Web enable the distributed evaluation of queries, where data sources are discovered during runtime. Data sources can have associated licenses that restrict allowed usages. Besides restrictions on the access to the data sources, the usage terms can also restrict the usage terms which can be assigned to derived data artefacts, e.g. query resul...
متن کاملSemi - supervised Learning Methods for Data Augmentation
The original goal of this project was to investigate the extent to which data augmentation schemes based on semi-supervised learning algorithms can improve classification accuracy in supervised learning problems. The objectives included determining the appropriate algorithms, customising them for the purposes of this project and providing their Matlab implementations. These algorithms were to b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i12.17324